Overview

Dataset statistics

Number of variables23
Number of observations2928
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory510.3 KiB
Average record size in memory178.5 B

Variable types

NUM21
CAT2

Reproduction

Analysis started2020-05-09 13:44:35.681042
Analysis finished2020-05-09 13:46:00.656682
Versionpandas-profiling v2.6.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml
Country has a high cardinality: 183 distinct values High cardinality
under-five_deaths is highly correlated with infant_deathsHigh Correlation
infant_deaths is highly correlated with under-five_deathsHigh Correlation
thinness_5-9_years is highly correlated with thinness_1-19_yearsHigh Correlation
thinness_1-19_years is highly correlated with thinness_5-9_yearsHigh Correlation
Alcohol is highly skewed (γ1 = 23.74597226) Skewed
BMI is highly skewed (γ1 = 21.39812262) Skewed
GDP is highly skewed (γ1 = 21.44293076) Skewed
infant_deaths has 838 (28.6%) zeros Zeros
Alcohol has 55 (1.9%) zeros Zeros
percentage_expenditure has 606 (20.7%) zeros Zeros
Measles has 973 (33.2%) zeros Zeros
under-five_deaths has 775 (26.5%) zeros Zeros
Income_composition_of_resources has 130 (4.4%) zeros Zeros

Variables

df_index
Real number (ℝ≥0)

UNIFORM
UNIQUE
Distinct count2928
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1467.527322
Minimum0
Maximum2937
Zeros1
Zeros (%)< 0.1%
Memory size23.0 KiB

Quantile statistics

Minimum0
5-th percentile146.35
Q1732.75
median1465.5
Q32203.25
95-th percentile2790.65
Maximum2937
Range2937
Interquartile range (IQR)1470.5

Descriptive statistics

Standard deviation848.8254023
Coefficient of variation (CV)0.5784051781
Kurtosis-1.201062795
Mean1467.527322
Median Absolute Deviation (MAD)734.9795455
Skewness0.002318914912
Sum4296920
Variance720504.5637
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 2937.], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2047 1 < 0.1%
 
1080 1 < 0.1%
 
1076 1 < 0.1%
 
1074 1 < 0.1%
 
1072 1 < 0.1%
 
1070 1 < 0.1%
 
1068 1 < 0.1%
 
1066 1 < 0.1%
 
1064 1 < 0.1%
 
1062 1 < 0.1%
 
Other values (2918) 2918 99.7%
 
ValueCountFrequency (%) 
0 1 < 0.1%
 
1 1 < 0.1%
 
2 1 < 0.1%
 
3 1 < 0.1%
 
4 1 < 0.1%
 
ValueCountFrequency (%) 
2937 1 < 0.1%
 
2936 1 < 0.1%
 
2935 1 < 0.1%
 
2934 1 < 0.1%
 
2933 1 < 0.1%
 

Country
Categorical

HIGH CARDINALITY
UNIFORM
Distinct count183
Unique (%)6.2%
Missing0
Missing (%)0.0%
Memory size12.3 KiB
Zimbabwe
 
16
Guinea
 
16
Grenada
 
16
Greece
 
16
Ghana
 
16
Other values (178)
2848
ValueCountFrequency (%) 
Zimbabwe 16 0.5%
 
Guinea 16 0.5%
 
Grenada 16 0.5%
 
Greece 16 0.5%
 
Ghana 16 0.5%
 
Germany 16 0.5%
 
Georgia 16 0.5%
 
Gambia 16 0.5%
 
Gabon 16 0.5%
 
France 16 0.5%
 
Other values (173) 2768 94.5%
 

Length

Max length52
Mean length10.04371585
Min length4
ValueCountFrequency (%) 
Lowercase_Letter 27 48.2%
 
Uppercase_Letter 24 42.9%
 
Dash_Punctuation 1 1.8%
 
Space_Separator 1 1.8%
 
Close_Punctuation 1 1.8%
 
Open_Punctuation 1 1.8%
 
Other_Punctuation 1 1.8%
 
ValueCountFrequency (%) 
Latin 51 91.1%
 
Common 5 8.9%
 
ValueCountFrequency (%) 
ASCII 55 100.0%
 

Year
Real number (ℝ≥0)

Distinct count16
Unique (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2007.5
Minimum2000
Maximum2015
Zeros0
Zeros (%)0.0%
Memory size23.0 KiB

Quantile statistics

Minimum2000
5-th percentile2000
Q12003.75
median2007.5
Q32011.25
95-th percentile2015
Maximum2015
Range15
Interquartile range (IQR)7.5

Descriptive statistics

Standard deviation4.610559618
Coefficient of variation (CV)0.002296667307
Kurtosis-1.209427576
Mean2007.5
Median Absolute Deviation (MAD)4
Skewness0
Sum5877960
Variance21.25725999
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[2000. 2000.5 2014.5 2015. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2005 183 6.2%
 
2015 183 6.2%
 
2004 183 6.2%
 
2014 183 6.2%
 
2003 183 6.2%
 
2013 183 6.2%
 
2002 183 6.2%
 
2012 183 6.2%
 
2001 183 6.2%
 
2011 183 6.2%
 
Other values (6) 1098 37.5%
 
ValueCountFrequency (%) 
2000 183 6.2%
 
2001 183 6.2%
 
2002 183 6.2%
 
2003 183 6.2%
 
2004 183 6.2%
 
ValueCountFrequency (%) 
2015 183 6.2%
 
2014 183 6.2%
 
2013 183 6.2%
 
2012 183 6.2%
 
2011 183 6.2%
 

Status
Categorical

Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
Developing
2416
Developed
512
ValueCountFrequency (%) 
Developing 2416 82.5%
 
Developed 512 17.5%
 

Length

Max length10
Mean length9.825136612
Min length9
ValueCountFrequency (%) 
Lowercase_Letter 9 90.0%
 
Uppercase_Letter 1 10.0%
 
ValueCountFrequency (%) 
Latin 10 100.0%
 
ValueCountFrequency (%) 
ASCII 10 100.0%
 

Life_expectancy
Real number (ℝ≥0)

Distinct count362
Unique (%)12.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean69.22493169
Minimum36.3
Maximum89
Zeros0
Zeros (%)0.0%
Memory size23.0 KiB

Quantile statistics

Minimum36.3
5-th percentile51.4
Q163.1
median72.1
Q375.7
95-th percentile82
Maximum89
Range52.7
Interquartile range (IQR)12.6

Descriptive statistics

Standard deviation9.523867488
Coefficient of variation (CV)0.1375785754
Kurtosis-0.2344773942
Mean69.22493169
Median Absolute Deviation (MAD)7.791540879
Skewness-0.6386047359
Sum202690.6
Variance90.70405193
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[36.3 44.15 51.05 57.25 59.95 ... 81.05 82.05 82.95 83.05 89. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
73 45 1.5%
 
75 33 1.1%
 
78 31 1.1%
 
73.6 28 1.0%
 
76 25 0.9%
 
73.9 25 0.9%
 
81 25 0.9%
 
74.7 24 0.8%
 
74.5 24 0.8%
 
74.2 23 0.8%
 
Other values (352) 2645 90.3%
 
ValueCountFrequency (%) 
36.3 1 < 0.1%
 
39 1 < 0.1%
 
41 1 < 0.1%
 
41.5 1 < 0.1%
 
42.3 1 < 0.1%
 
ValueCountFrequency (%) 
89 11 0.4%
 
88 10 0.3%
 
87 9 0.3%
 
86 15 0.5%
 
85 12 0.4%
 

Adult_Mortality
Real number (ℝ≥0)

Distinct count425
Unique (%)14.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean164.7964481
Minimum1
Maximum723
Zeros0
Zeros (%)0.0%
Memory size23.0 KiB

Quantile statistics

Minimum1
5-th percentile13
Q174
median144
Q3228
95-th percentile398.3
Maximum723
Range722
Interquartile range (IQR)154

Descriptive statistics

Standard deviation124.292079
Coefficient of variation (CV)0.754215764
Kurtosis1.748860208
Mean164.7964481
Median Absolute Deviation (MAD)95.36008559
Skewness1.174369488
Sum482524
Variance15448.5209
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1. 1.5 5.5 11.5 17.5 ... 398. 411.5 438.5 497.5 723. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
12 34 1.2%
 
14 30 1.0%
 
16 29 1.0%
 
138 25 0.9%
 
11 25 0.9%
 
19 23 0.8%
 
144 22 0.8%
 
13 21 0.7%
 
15 21 0.7%
 
17 21 0.7%
 
Other values (415) 2677 91.4%
 
ValueCountFrequency (%) 
1 12 0.4%
 
2 8 0.3%
 
3 6 0.2%
 
4 4 0.1%
 
5 2 0.1%
 
ValueCountFrequency (%) 
723 1 < 0.1%
 
717 1 < 0.1%
 
715 1 < 0.1%
 
699 1 < 0.1%
 
693 1 < 0.1%
 

infant_deaths
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS
Distinct count209
Unique (%)7.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean30.40744536
Minimum0
Maximum1800
Zeros838
Zeros (%)28.6%
Memory size23.0 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median3
Q322
95-th percentile94.65
Maximum1800
Range1800
Interquartile range (IQR)22

Descriptive statistics

Standard deviation118.1144496
Coefficient of variation (CV)3.884392399
Kurtosis115.6574795
Mean30.40744536
Median Absolute Deviation (MAD)40.79144994
Skewness9.771044493
Sum89033
Variance13951.0232
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.000e+00 5.000e-01 1.500e+00 3.500e+00 4.500e+00 ... 2.505e+02 3.595e+02 3.715e+02 5.750e+02 1.800e+03], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 838 28.6%
 
1 342 11.7%
 
2 203 6.9%
 
3 175 6.0%
 
4 96 3.3%
 
8 57 1.9%
 
7 53 1.8%
 
9 48 1.6%
 
10 48 1.6%
 
6 46 1.6%
 
Other values (199) 1022 34.9%
 
ValueCountFrequency (%) 
0 838 28.6%
 
1 342 11.7%
 
2 203 6.9%
 
3 175 6.0%
 
4 96 3.3%
 
ValueCountFrequency (%) 
1800 2 0.1%
 
1700 2 0.1%
 
1600 1 < 0.1%
 
1500 2 0.1%
 
1400 1 < 0.1%
 

Alcohol
Real number (ℝ≥0)

SKEWED
ZEROS
Distinct count1115
Unique (%)38.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.22736081
Minimum0
Maximum455
Zeros55
Zeros (%)1.9%
Memory size23.0 KiB

Quantile statistics

Minimum0
5-th percentile0.01
Q10.8
median3.675
Q37.84
95-th percentile12.3395
Maximum455
Range455
Interquartile range (IQR)7.04

Descriptive statistics

Standard deviation12.2927326
Coefficient of variation (CV)2.351613568
Kurtosis740.2675999
Mean5.22736081
Median Absolute Deviation (MAD)4.287240926
Skewness23.74597226
Sum15305.71245
Variance151.1112749
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.00000000e+00 5.00000000e-03 1.50000000e-02 9.50000000e-02 4.95000000e-01 ... 1.22850000e+01 1.35450000e+01 1.51650000e+01 3.63797496e+01 4.55000000e+02], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.01 280 9.6%
 
0 55 1.9%
 
0.5 28 1.0%
 
1 19 0.6%
 
0.03 15 0.5%
 
0.04 13 0.4%
 
0.02 12 0.4%
 
0.09 12 0.4%
 
0.21 10 0.3%
 
1.5 10 0.3%
 
Other values (1105) 2474 84.5%
 
ValueCountFrequency (%) 
0 55 1.9%
 
0.01 280 9.6%
 
0.02 12 0.4%
 
0.03 15 0.5%
 
0.04 13 0.4%
 
ValueCountFrequency (%) 
455 1 < 0.1%
 
263.653836 1 < 0.1%
 
241.5 1 < 0.1%
 
176 1 < 0.1%
 
118 1 < 0.1%
 

percentage_expenditure
Real number (ℝ≥0)

ZEROS
Distinct count2323
Unique (%)79.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean740.321185
Minimum0
Maximum19479.91161
Zeros606
Zeros (%)20.7%
Memory size23.0 KiB

Quantile statistics

Minimum0
5-th percentile0
Q14.853963995
median65.61145482
Q3442.6143215
95-th percentile4507.913607
Maximum19479.91161
Range19479.91161
Interquartile range (IQR)437.7603575

Descriptive statistics

Standard deviation1990.930605
Coefficient of variation (CV)2.689279525
Kurtosis26.47582908
Mean740.321185
Median Absolute Deviation (MAD)1019.346596
Skewness4.643789672
Sum2167660.43
Variance3963804.673
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.00000000e+00 4.99360950e-02 1.18551412e+01 4.77767119e+01 8.12159390e+01 ... 2.28423697e+03 3.88244131e+03 8.39206517e+03 1.19676540e+04 1.94799116e+04], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 606 20.7%
 
345.9044258 1 < 0.1%
 
2698.01817 1 < 0.1%
 
3.43334364 1 < 0.1%
 
8.758214538 1 < 0.1%
 
5.103249438 1 < 0.1%
 
70.27113179 1 < 0.1%
 
6164.455402 1 < 0.1%
 
0.962497052 1 < 0.1%
 
253.4022338 1 < 0.1%
 
Other values (2313) 2313 79.0%
 
ValueCountFrequency (%) 
0 606 20.7%
 
0.09987219 1 < 0.1%
 
0.108055973 1 < 0.1%
 
0.27564826 1 < 0.1%
 
0.328418056 1 < 0.1%
 
ValueCountFrequency (%) 
19479.91161 1 < 0.1%
 
19099.04506 1 < 0.1%
 
18961.3486 1 < 0.1%
 
18822.86732 1 < 0.1%
 
18379.32974 1 < 0.1%
 

Hepatitis_B
Real number (ℝ≥0)

Distinct count605
Unique (%)20.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean609.6203468
Minimum0
Maximum106102.7409
Zeros24
Zeros (%)0.8%
Memory size23.0 KiB

Quantile statistics

Minimum0
5-th percentile8.132091554
Q176
median93
Q398
95-th percentile2225.2728
Maximum106102.7409
Range106102.7409
Interquartile range (IQR)22

Descriptive statistics

Standard deviation3930.909022
Coefficient of variation (CV)6.44812635
Kurtosis385.6755466
Mean609.6203468
Median Absolute Deviation (MAD)957.0549138
Skewness17.552033
Sum1784968.375
Variance15452045.74
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.00000000e+00 5.00000000e-01 5.93941724e+00 6.01654178e+00 6.75000000e+00 ... 3.08891683e+03 6.07383210e+03 1.25062295e+04 3.10719198e+04 1.06102741e+05], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
99 237 8.1%
 
98 209 7.1%
 
96 166 5.7%
 
97 154 5.3%
 
95 149 5.1%
 
94 127 4.3%
 
93 101 3.4%
 
92 92 3.1%
 
91 75 2.6%
 
89 71 2.4%
 
Other values (595) 1547 52.8%
 
ValueCountFrequency (%) 
0 24 0.8%
 
1 2 0.1%
 
1.767286755 1 < 0.1%
 
1.920614288 1 < 0.1%
 
2 4 0.1%
 
ValueCountFrequency (%) 
106102.7409 1 < 0.1%
 
91242.5 1 < 0.1%
 
84061.41549 1 < 0.1%
 
70644.09775 1 < 0.1%
 
40061.5 1 < 0.1%
 

Measles
Real number (ℝ≥0)

ZEROS
Distinct count958
Unique (%)32.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2427.855874
Minimum0
Maximum212183
Zeros973
Zeros (%)33.2%
Memory size23.0 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median17
Q3362.25
95-th percentile9994.05
Maximum212183
Range212183
Interquartile range (IQR)362.25

Descriptive statistics

Standard deviation11485.97094
Coefficient of variation (CV)4.73091136
Kurtosis114.4679785
Mean2427.855874
Median Absolute Deviation (MAD)3935.175074
Skewness9.425290043
Sum7108762
Variance131927528.4
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.00000e+00 5.00000e-01 1.50000e+00 3.50000e+00 1.15000e+01 ... 7.87500e+03 1.28445e+04 2.49720e+04 7.19540e+04 2.12183e+05], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 973 33.2%
 
1 104 3.6%
 
2 68 2.3%
 
3 44 1.5%
 
4 33 1.1%
 
6 29 1.0%
 
7 28 1.0%
 
5 25 0.9%
 
8 24 0.8%
 
9 22 0.8%
 
Other values (948) 1578 53.9%
 
ValueCountFrequency (%) 
0 973 33.2%
 
1 104 3.6%
 
2 68 2.3%
 
3 44 1.5%
 
4 33 1.1%
 
ValueCountFrequency (%) 
212183 1 < 0.1%
 
182485 1 < 0.1%
 
168107 1 < 0.1%
 
141258 1 < 0.1%
 
133802 1 < 0.1%
 

BMI
Real number (ℝ≥0)

SKEWED
Distinct count624
Unique (%)21.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean47.55420082
Minimum1
Maximum4832
Zeros0
Zeros (%)0.0%
Memory size23.0 KiB

Quantile statistics

Minimum1
5-th percentile5.2
Q119.4
median43.5
Q356.3
95-th percentile65.065
Maximum4832
Range4831
Interquartile range (IQR)36.9

Descriptive statistics

Standard deviation159.6296548
Coefficient of variation (CV)3.356793975
Kurtosis528.016447
Mean47.55420082
Median Absolute Deviation (MAD)27.27043917
Skewness21.39812262
Sum139238.7
Variance25481.62669
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[1.0000e+00 2.0500e+00 2.9500e+00 5.0500e+00 7.0000e+00 ... 6.6650e+01 6.9650e+01 7.7350e+01 4.2225e+02 4.8320e+03], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
58.5 18 0.6%
 
57 16 0.5%
 
55.8 16 0.5%
 
54.2 15 0.5%
 
59.9 15 0.5%
 
59.3 14 0.5%
 
56.5 13 0.4%
 
58.1 13 0.4%
 
52.8 13 0.4%
 
59.4 13 0.4%
 
Other values (614) 2782 95.0%
 
ValueCountFrequency (%) 
1 1 < 0.1%
 
1.4 2 0.1%
 
1.8 1 < 0.1%
 
1.9 1 < 0.1%
 
2 1 < 0.1%
 
ValueCountFrequency (%) 
4832 1 < 0.1%
 
4306 1 < 0.1%
 
2853.5 1 < 0.1%
 
2317.5 1 < 0.1%
 
2242.5 1 < 0.1%
 

under-five_deaths
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS
Distinct count252
Unique (%)8.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean42.17930328
Minimum0
Maximum2500
Zeros775
Zeros (%)26.5%
Memory size23.0 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median4
Q328
95-th percentile138
Maximum2500
Range2500
Interquartile range (IQR)28

Descriptive statistics

Standard deviation160.7005471
Coefficient of variation (CV)3.809938395
Kurtosis109.3884348
Mean42.17930328
Median Absolute Deviation (MAD)57.24749661
Skewness9.479622923
Sum123501
Variance25824.66582
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.000e+00 5.000e-01 1.500e+00 4.500e+00 1.250e+01 ... 3.370e+02 4.570e+02 4.665e+02 9.395e+02 2.500e+03], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 775 26.5%
 
1 361 12.3%
 
2 163 5.6%
 
4 161 5.5%
 
3 129 4.4%
 
12 53 1.8%
 
8 49 1.7%
 
6 48 1.6%
 
10 47 1.6%
 
5 44 1.5%
 
Other values (242) 1098 37.5%
 
ValueCountFrequency (%) 
0 775 26.5%
 
1 361 12.3%
 
2 163 5.6%
 
3 129 4.4%
 
4 161 5.5%
 
ValueCountFrequency (%) 
2500 1 < 0.1%
 
2400 1 < 0.1%
 
2300 1 < 0.1%
 
2200 1 < 0.1%
 
2100 1 < 0.1%
 

Polio
Real number (ℝ≥0)

Distinct count91
Unique (%)3.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean82.1534597
Minimum3
Maximum99
Zeros0
Zeros (%)0.0%
Memory size23.0 KiB

Quantile statistics

Minimum3
5-th percentile9
Q177
median93
Q397
95-th percentile99
Maximum99
Range96
Interquartile range (IQR)20

Descriptive statistics

Standard deviation23.87773499
Coefficient of variation (CV)0.2906479542
Kurtosis3.474033021
Mean82.1534597
Median Absolute Deviation (MAD)17.02392519
Skewness-2.044190865
Sum240545.33
Variance570.1462285
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 3. 3.9375 4.0575 6.5 8.5 ... 91.5 93.5 97.5 98.5 99. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
99 373 12.7%
 
98 254 8.7%
 
97 205 7.0%
 
96 205 7.0%
 
95 180 6.1%
 
94 159 5.4%
 
93 120 4.1%
 
92 96 3.3%
 
91 88 3.0%
 
9 70 2.4%
 
Other values (81) 1178 40.2%
 
ValueCountFrequency (%) 
3 7 0.2%
 
3.63 1 < 0.1%
 
3.66 1 < 0.1%
 
3.875 1 < 0.1%
 
4 11 0.4%
 
ValueCountFrequency (%) 
99 373 12.7%
 
98 254 8.7%
 
97 205 7.0%
 
96 205 7.0%
 
95 180 6.1%
 

Total_expenditure
Real number (ℝ≥0)

Distinct count897
Unique (%)30.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.4497097
Minimum0.37
Maximum99
Zeros0
Zeros (%)0.0%
Memory size23.0 KiB

Quantile statistics

Minimum0.37
5-th percentile1.9835
Q14.36
median5.94
Q38.18
95-th percentile81.825
Maximum99
Range98.63
Interquartile range (IQR)3.82

Descriptive statistics

Standard deviation20.63972839
Coefficient of variation (CV)1.80264207
Kurtosis11.14558805
Mean11.4497097
Median Absolute Deviation (MAD)10.28756334
Skewness3.544407992
Sum33524.75
Variance425.9983881
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0.37 1.11 3.165 3.995 4.105 ... 17.55 85.75 94.75 98.75 99. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
99 24 0.8%
 
97 18 0.6%
 
4.6 15 0.5%
 
95 14 0.5%
 
6.7 12 0.4%
 
5.6 11 0.4%
 
5.25 10 0.3%
 
5.64 10 0.3%
 
5.3 10 0.3%
 
9.1 10 0.3%
 
Other values (887) 2794 95.4%
 
ValueCountFrequency (%) 
0.37 1 < 0.1%
 
0.65 1 < 0.1%
 
0.74 1 < 0.1%
 
0.76 1 < 0.1%
 
0.92 1 < 0.1%
 
ValueCountFrequency (%) 
99 24 0.8%
 
98.5 4 0.1%
 
98 8 0.3%
 
97.5 6 0.2%
 
97 18 0.6%
 

Diphtheria
Real number (ℝ≥0)

Distinct count99
Unique (%)3.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean81.84961749
Minimum1.68
Maximum99
Zeros0
Zeros (%)0.0%
Memory size23.0 KiB

Quantile statistics

Minimum1.68
5-th percentile9
Q178
median93
Q397
95-th percentile99
Maximum99
Range97.32
Interquartile range (IQR)19

Descriptive statistics

Standard deviation24.34386033
Coefficient of variation (CV)0.2974217971
Kurtosis3.234961656
Mean81.84961749
Median Absolute Deviation (MAD)17.29726895
Skewness-2.019167689
Sum239655.68
Variance592.6235356
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1.68 3.855 4.0825 6.5 11.125 ... 81.5 91.5 94.5 98.5 99. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
99 347 11.9%
 
98 253 8.6%
 
97 205 7.0%
 
95 200 6.8%
 
96 199 6.8%
 
94 149 5.1%
 
93 120 4.1%
 
92 100 3.4%
 
91 91 3.1%
 
89 76 2.6%
 
Other values (89) 1188 40.6%
 
ValueCountFrequency (%) 
1.68 1 < 0.1%
 
1.925 1 < 0.1%
 
2 1 < 0.1%
 
3 4 0.1%
 
3.71 1 < 0.1%
 
ValueCountFrequency (%) 
99 347 11.9%
 
98 253 8.6%
 
97 205 7.0%
 
96 199 6.8%
 
95 200 6.8%
 

HIV/AIDS
Real number (ℝ≥0)

Distinct count200
Unique (%)6.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.747711749
Minimum0.1
Maximum50.6
Zeros0
Zeros (%)0.0%
Memory size23.0 KiB

Quantile statistics

Minimum0.1
5-th percentile0.1
Q10.1
median0.1
Q30.8
95-th percentile8.565
Maximum50.6
Range50.5
Interquartile range (IQR)0.7

Descriptive statistics

Standard deviation5.08554241
Coefficient of variation (CV)2.909829046
Kurtosis34.76639831
Mean1.747711749
Median Absolute Deviation (MAD)2.458204116
Skewness5.386623166
Sum5117.3
Variance25.8627416
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0.1 0.15 0.35 0.45 0.95 ... 5.35 9.05 16.05 34.7 50.6 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.1 1771 60.5%
 
0.2 124 4.2%
 
0.3 115 3.9%
 
0.4 69 2.4%
 
0.5 42 1.4%
 
0.6 35 1.2%
 
0.8 32 1.1%
 
0.9 32 1.1%
 
0.7 29 1.0%
 
1.5 21 0.7%
 
Other values (190) 658 22.5%
 
ValueCountFrequency (%) 
0.1 1771 60.5%
 
0.2 124 4.2%
 
0.3 115 3.9%
 
0.4 69 2.4%
 
0.5 42 1.4%
 
ValueCountFrequency (%) 
50.6 1 < 0.1%
 
50.3 1 < 0.1%
 
49.9 1 < 0.1%
 
49.1 1 < 0.1%
 
48.8 1 < 0.1%
 

GDP
Real number (ℝ≥0)

SKEWED
Distinct count2675
Unique (%)91.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean35290.45329
Minimum0.1333333333
Maximum12813813.05
Zeros0
Zeros (%)0.0%
Memory size23.0 KiB

Quantile statistics

Minimum0.1333333333
5-th percentile0.9
Q1216.4565635
median1220.95193
Q35121.931677
95-th percentile39873.03607
Maximum12813813.05
Range12813812.92
Interquartile range (IQR)4905.475114

Descriptive statistics

Standard deviation513617.1812
Coefficient of variation (CV)14.55399785
Kurtosis487.2352765
Mean35290.45329
Median Absolute Deviation (MAD)59324.15613
Skewness21.44293076
Sum103330447.2
Variance2.638026089e+11
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[1.33333333e-01 1.50000000e-01 2.33333333e-01 5.50000000e-01 6.16666667e-01 1.53333333e+00 1.53333333e+00 1.28138130e+07], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.5666666667 20 0.7%
 
0.3 19 0.6%
 
0.6 19 0.6%
 
0.1333333333 14 0.5%
 
0.3333333333 13 0.4%
 
0.4666666667 11 0.4%
 
1.5 10 0.3%
 
1.233333333 10 0.3%
 
0.9 10 0.3%
 
1.166666667 9 0.3%
 
Other values (2665) 2793 95.4%
 
ValueCountFrequency (%) 
0.1333333333 14 0.5%
 
0.1666666667 2 0.1%
 
0.3 19 0.6%
 
0.3333333333 13 0.4%
 
0.4666666667 11 0.4%
 
ValueCountFrequency (%) 
12813813.05 1 < 0.1%
 
12469649.55 1 < 0.1%
 
12125824.55 1 < 0.1%
 
11782706.55 1 < 0.1%
 
9367493.55 1 < 0.1%
 

Population
Real number (ℝ≥0)

Distinct count2676
Unique (%)91.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9957659.106
Minimum0.1666666667
Maximum1293859294
Zeros0
Zeros (%)0.0%
Memory size23.0 KiB

Quantile statistics

Minimum0.1666666667
5-th percentile1.633333333
Q113717.5
median545917.5
Q34592776.75
95-th percentile41358873.85
Maximum1293859294
Range1293859294
Interquartile range (IQR)4579059.25

Descriptive statistics

Standard deviation54164922.93
Coefficient of variation (CV)5.439523723
Kurtosis378.7705864
Mean9957659.106
Median Absolute Deviation (MAD)14433589.82
Skewness17.90694885
Sum2.915602586e+10
Variance2.933838877e+15
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[1.66666667e-01 2.00000000e-01 3.66666667e-01 6.50000000e-01 8.16666667e-01 ... 2.98692945e+07 4.95186626e+07 8.58452450e+07 2.56646614e+08 1.29385929e+09], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.5 19 0.6%
 
1.1 17 0.6%
 
1.033333333 16 0.5%
 
0.1666666667 14 0.5%
 
0.5666666667 13 0.4%
 
2.366666667 10 0.3%
 
2.9 10 0.3%
 
1.7 10 0.3%
 
0.8333333333 10 0.3%
 
2.233333333 9 0.3%
 
Other values (2666) 2800 95.6%
 
ValueCountFrequency (%) 
0.1666666667 14 0.5%
 
0.2333333333 2 0.1%
 
0.5 19 0.6%
 
0.5666666667 13 0.4%
 
0.7333333333 1 < 0.1%
 
ValueCountFrequency (%) 
1293859294 1 < 0.1%
 
1179681239 1 < 0.1%
 
1161977719 1 < 0.1%
 
1144118674 1 < 0.1%
 
1126135777 1 < 0.1%
 

thinness_1-19_years
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count232
Unique (%)7.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean79412.23854
Minimum0.1
Maximum25158608.83
Zeros0
Zeros (%)0.0%
Memory size23.0 KiB

Quantile statistics

Minimum0.1
5-th percentile0.6
Q11.6
median3.4
Q37.3
95-th percentile14.7
Maximum25158608.83
Range25158608.73
Interquartile range (IQR)5.7

Descriptive statistics

Standard deviation1202290.776
Coefficient of variation (CV)15.13986759
Kurtosis333.1783789
Mean79412.23854
Median Absolute Deviation (MAD)157116.4416
Skewness17.92889476
Sum232519034.5
Variance1.445503111e+12
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[1.00000000e-01 3.50000000e-01 4.50000000e-01 6.50000000e-01 2.35000000e+00 ... 1.98500000e+01 2.67500000e+01 2.76000000e+01 2.59223249e+06 2.51586088e+07], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1 74 2.5%
 
1.9 65 2.2%
 
0.8 64 2.2%
 
0.7 63 2.2%
 
1.2 62 2.1%
 
2.1 61 2.1%
 
1.5 60 2.0%
 
2.2 58 2.0%
 
0.9 57 1.9%
 
2 57 1.9%
 
Other values (222) 2307 78.8%
 
ValueCountFrequency (%) 
0.1 23 0.8%
 
0.2 39 1.3%
 
0.3 32 1.1%
 
0.4 5 0.2%
 
0.5 35 1.2%
 
ValueCountFrequency (%) 
25158608.83 1 < 0.1%
 
24566612.16 1 < 0.1%
 
23444876.15 1 < 0.1%
 
22923975.49 1 < 0.1%
 
21970330.81 1 < 0.1%
 

thinness_5-9_years
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count239
Unique (%)8.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean39708.55011
Minimum0.1
Maximum12579304.66
Zeros0
Zeros (%)0.0%
Memory size23.0 KiB

Quantile statistics

Minimum0.1
5-th percentile0.5
Q11.6
median3.4
Q37.3
95-th percentile15.065
Maximum12579304.66
Range12579304.56
Interquartile range (IQR)5.7

Descriptive statistics

Standard deviation601145.2406
Coefficient of variation (CV)15.13893706
Kurtosis333.1784456
Mean39708.55011
Median Absolute Deviation (MAD)78558.17414
Skewness17.92889697
Sum116266634.7
Variance3.613756003e+11
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[1.00000000e-01 2.50000000e-01 4.50000000e-01 2.55000000e+00 3.95000000e+00 ... 2.25000000e+01 2.73500000e+01 2.85500000e+01 1.29611647e+06 1.25793047e+07], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.9 69 2.4%
 
1.1 67 2.3%
 
0.5 63 2.2%
 
1.9 63 2.2%
 
1 62 2.1%
 
2.1 61 2.1%
 
1.3 59 2.0%
 
1.5 57 1.9%
 
1.7 55 1.9%
 
0.6 54 1.8%
 
Other values (229) 2318 79.2%
 
ValueCountFrequency (%) 
0.1 31 1.1%
 
0.2 45 1.5%
 
0.3 25 0.9%
 
0.4 17 0.6%
 
0.5 63 2.2%
 
ValueCountFrequency (%) 
12579304.66 1 < 0.1%
 
12283306.32 1 < 0.1%
 
11722438.31 1 < 0.1%
 
11461987.97 1 < 0.1%
 
10985165.63 1 < 0.1%
 

Income_composition_of_resources
Real number (ℝ≥0)

ZEROS
Distinct count689
Unique (%)23.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.8222660519
Minimum0
Maximum12.3
Zeros130
Zeros (%)4.4%
Memory size23.0 KiB

Quantile statistics

Minimum0
5-th percentile0.291
Q10.5
median0.6845
Q30.792
95-th percentile0.92965
Maximum12.3
Range12.3
Interquartile range (IQR)0.292

Descriptive statistics

Standard deviation1.125777308
Coefficient of variation (CV)1.369115635
Kurtosis41.18735401
Mean0.8222660519
Median Absolute Deviation (MAD)0.3903906364
Skewness6.093125988
Sum2407.595
Variance1.267374547
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 0.1265 0.254 0.2975 0.3805 ... 1.05 3.65 4.95 8.2 12.3 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 130 4.4%
 
0.6 19 0.6%
 
0.7 18 0.6%
 
0.5 18 0.6%
 
1 14 0.5%
 
0.739 13 0.4%
 
0.636 12 0.4%
 
0.714 12 0.4%
 
0.703 11 0.4%
 
0.86 11 0.4%
 
Other values (679) 2670 91.2%
 
ValueCountFrequency (%) 
0 130 4.4%
 
0.253 1 < 0.1%
 
0.255 1 < 0.1%
 
0.261 1 < 0.1%
 
0.266 1 < 0.1%
 
ValueCountFrequency (%) 
12.3 1 < 0.1%
 
12.1 1 < 0.1%
 
11.9 1 < 0.1%
 
11.7 1 < 0.1%
 
11.5 1 < 0.1%
 

Schooling
Real number (ℝ≥0)

Distinct count188
Unique (%)6.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.57305328
Minimum0
Maximum20.7
Zeros26
Zeros (%)0.9%
Memory size23.0 KiB

Quantile statistics

Minimum0
5-th percentile4.635
Q19.6
median12.1
Q314.1
95-th percentile16.8
Maximum20.7
Range20.7
Interquartile range (IQR)4.5

Descriptive statistics

Standard deviation3.782065454
Coefficient of variation (CV)0.3267992779
Kurtosis0.7460197911
Mean11.57305328
Median Absolute Deviation (MAD)2.909369261
Skewness-0.7530717985
Sum33885.9
Variance14.3040191
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 0.65 4.85 7.05 9.95 ... 15.95 16.55 17.65 19.25 20.7 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
12.9 58 2.0%
 
13.3 52 1.8%
 
12.5 49 1.7%
 
12.8 46 1.6%
 
12.3 45 1.5%
 
12.6 43 1.5%
 
11.9 42 1.4%
 
12.4 42 1.4%
 
10.7 41 1.4%
 
11.7 41 1.4%
 
Other values (178) 2469 84.3%
 
ValueCountFrequency (%) 
0 26 0.9%
 
0.5 15 0.5%
 
0.6 16 0.5%
 
0.7 1 < 0.1%
 
1 14 0.5%
 
ValueCountFrequency (%) 
20.7 1 < 0.1%
 
20.6 1 < 0.1%
 
20.5 1 < 0.1%
 
20.4 3 0.1%
 
20.3 4 0.1%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Missing values

Sample

First rows

df_indexCountryYearStatusLife_expectancyAdult_Mortalityinfant_deathsAlcoholpercentage_expenditureHepatitis_BMeaslesBMIunder-five_deathsPolioTotal_expenditureDiphtheriaHIV/AIDSGDPPopulationthinness_1-19_yearsthinness_5-9_yearsIncome_composition_of_resourcesSchooling
00Afghanistan2015.0Developing65.0263.062.00.0171.27962465.01154.019.183.06.08.1665.00.1584.25921033736494.017.217.30.47910.1
11Afghanistan2014.0Developing59.9271.064.00.0173.52358262.0492.018.686.058.08.1862.00.1612.696514327582.017.517.50.47610.0
22Afghanistan2013.0Developing59.9268.066.00.0173.21924364.0430.018.189.062.08.1364.00.1631.74497631731688.017.717.70.4709.9
33Afghanistan2012.0Developing59.5272.069.00.0178.18421567.02787.017.693.067.08.5267.00.1669.9590003696958.017.918.00.4639.8
44Afghanistan2011.0Developing59.2275.071.00.017.09710968.03013.017.297.068.07.8768.00.163.5372312978599.018.218.20.4549.5
55Afghanistan2010.0Developing58.8279.074.00.0179.67936766.01989.016.7102.066.09.2066.00.1553.3289402883167.018.418.40.4489.2
66Afghanistan2009.0Developing58.6281.077.00.0156.76221763.02861.016.2106.063.09.4263.00.1445.893298284331.018.618.70.4348.9
77Afghanistan2008.0Developing58.1287.080.00.0325.87392564.01599.015.7110.064.08.3364.00.1373.3611162729431.018.818.90.4338.7
88Afghanistan2007.0Developing57.5295.082.00.0210.91015663.01141.015.2113.063.06.7363.00.1369.83579626616792.019.019.10.4158.4
99Afghanistan2006.0Developing57.3295.084.00.0317.17151864.01990.014.7116.058.07.4358.00.1272.5637702589345.019.219.30.4058.1

Last rows

df_indexCountryYearStatusLife_expectancyAdult_Mortalityinfant_deathsAlcoholpercentage_expenditureHepatitis_BMeaslesBMIunder-five_deathsPolioTotal_expenditureDiphtheriaHIV/AIDSGDPPopulationthinness_1-19_yearsthinness_5-9_yearsIncome_composition_of_resourcesSchooling
29182928Zimbabwe2009.0Developing50.0587.030.04.641.04002173.0853.029.045.069.06.2673.018.165.8241211381599.07.57.40.4199.9
29192929Zimbabwe2008.0Developing48.2632.030.03.5620.84342975.00.028.646.075.04.9675.020.5325.67857313558469.07.87.80.4219.7
29202930Zimbabwe2007.0Developing46.667.029.03.8829.81456672.0242.028.246.073.04.4773.023.7396.9982171332999.08.28.20.4149.6
29212931Zimbabwe2006.0Developing45.47.028.04.5734.26216968.0212.027.945.071.05.127.026.8414.79623213124267.08.68.60.4089.5
29222932Zimbabwe2005.0Developing44.6717.028.04.148.71740965.0420.027.543.069.06.4468.030.3444.765750129432.09.09.00.4069.3
29232933Zimbabwe2004.0Developing44.3723.027.04.360.00000068.031.027.142.067.07.1365.033.6454.36665412777511.09.49.40.4079.2
29242934Zimbabwe2003.0Developing44.5715.026.04.060.0000007.0998.026.741.07.06.5268.036.7453.35115512633897.09.89.90.4189.5
29252935Zimbabwe2002.0Developing44.873.025.04.430.00000073.0304.026.340.073.06.5371.039.857.348340125525.01.21.30.42710.0
29262936Zimbabwe2001.0Developing45.3686.025.01.720.00000076.0529.025.939.076.06.1675.042.1548.58731212366165.01.61.70.4279.8
29272937Zimbabwe2000.0Developing46.0665.024.01.680.00000079.01483.025.539.078.07.1078.043.5547.35887912222251.011.011.20.4349.8